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Non- African populations have experienced major bottlenecks in the time since their split 
from West Africans, which has led to the hypothesis that natural selection to remove 
weakly deleterious mutations may have been less effective in non-Africans. To directly test 
this hypothesis, we measure the per-genome accumulation of deleterious mutations across 
diverse humans. We fail to detect any significant differences, but find that archaic 
Denisovans accumulated non-synonymous mutations at a higher rate than modern 
humans, consistent with the longer separation time of modern and archaic humans. We 
also revisit the empirical patterns that have been interpreted as evidence for less effective 
removal of deleterious mutations in non- Africans than in West Africans, and show they are 
not driven by differences in selection after population separation, but by neutral evolution. 

The effectiveness with which natural selection removes deleterious mutations from a population 
depends not only on the selection coefficient (s) of a mutation, but on the product of this and the 
population size (AO (Ns, reviewed in 1). Thus, the rate at which deleterious alleles are removed 
from a population depends on demographic history. Profound demographic differences across 
humans are well documented. Founder events in the last hundred thousand years have reduced 
the nucleotide diversity (the number of differences per base pair between an individual's two 
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chromosomes) in non-Africans by at least 20% relative to West Africans ~ , reflecting times 



Downloaded from http://biorxiv.org/on September 18, 2014 



when the ancestors of non-Africans had relatively smaller population sizes. Similarly, the advent 
of agriculture in the last ten thousand years has led to rapid population expansions. 

To investigate the empirical evidence for natural selection having differed in its effectiveness 
across human populations, past studies have contrasted mutation classes thought to be subject to 
little selection (synonymous mutations in genes) to those potentially subject to purifying 
selection (non-synonymous mutations) 5 " 7 . The most important study on this topic measured the 
proportion of polymorphic positions in genes that are non-synonymous in 20 European and 15 
African American samples, and showed that while all such segregating sites have a reduced rate 
in non-Africans, the reduction is proportionally less for non-synonymous sites. 5 Based on 
simulations, this observation was interpreted as being consistent with reduced effectiveness of 
selection against weakly deleterious alleles in European than in West African populations due to 
their smaller size since separation 5 . Subsequent studies have confirmed the empirical observation 
and favored a similar interpretation 6 ' 8 ' 9 . These studies have clearly documented that there has 
been an interaction between the forces of natural selection and demographic history with regard 
to their effects on densities of non-synonymous sites 5 ' 6 ' 8 ' 9 . However, such observations do not 
necessarily imply that there have been differences in the effectiveness of selection between the 
two populations after the split. An alternative explanation is that in the common ancestral 
population of Europeans and West Africans, the average derived frequency for non-synonymous 
alleles was lower than for synonymous alleles, as negative selection places downward pressure 
on the frequency of derived alleles 5 . The different initial distributions for the two classes of 
alleles would have responded differently to the bottleneck that then occurred in European 
populations, simply because they started out with different shapes, an effect that would be 
expected to occur even in the absence of any differences in the effectiveness of selection 
between the two populations since they split. 

The most direct way to contrast the effectiveness of selection between two populations is to 
sample a single haploid genome from each population, count all the differences, and measure 
which of the two lineages carries an excess. Any two lineages that are compared in this manner 
must, by definition, be separated by the same amount of time since their common ancestor, and 
are thus expected to harbor the same number of lineage-specific mutations in the absence of 
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selection or differences in mutation rate. In the presence of selection, however, mutations are 
removed from each lineage at a rate that depends on the product Ns, such that differences in the 
effectiveness of selection can be inferred from a detected asymmetry in the number of mutations 
in the two lineages. Here, we test for differences in the accumulation of mutations between two 
lineages by sampling one allele from population X and one from Y, determining the ancestral 
state based on the chimpanzee genome (PanTroT), and recording all the differences. We count 
the number of derived mutations in population X but not Y (Lxnot r) and in 7 but not X (L Y „ 0 tx ), 
and define a statistic Rx/y = Lxnot y I L Yno tx- We average over all possible pairs of samples, and 
compute a standard error using a Weighted Block Jackknife to correct for correlation among 
neighboring sites (Methods) 10 . If selection has been equally effective since the split and mutation 
rates have been the same on the two lineages, Rx/y should be within a few standard errors from 1. 

We measured RwestAfrica/Eumpe in four sequencing datasets: (1) coding regions of genes (exomes) 
from 15 African Americans and 20 European Americans 5 ; (2) exomes from 1,089 individuals in 
the 1000 Genomes Project (1KG) 11 ; (3) exomes from 1,088 African Americans and 1,351 
European Americans 6 , and (4) 24 whole genomes sequenced to high coverage ' (Table SI). As 
expected for sites unaffected by selection and for mutation rates being indistinguishable on the 
two lineages, RwestAflica/EwopJisynonymous) is always within 2 standard errors of 1 (Table 1 and 
Table SI). However, RwesUflica/Europe(non-synonymous) is also indistinguishable from 1 (Table 1, 
Table 2, and Table S2). Thus, our empirical data provide no evidence that purging of weakly 
deleterious mutations has been less effective in Europeans than in West- Africans. This null result 
is consistent with the recently reported finding of ref. 14 for similar population comparisons, 
which used a somewhat different approach. To generalize these results, we extended the analysis 
to a diverse set of human populations by computing Rx/y between all possible pairs of 5 diverse 
sub-Saharan African and 6 non-African populations 13 , and fourteen 1000 Genomes Project 
populations. We observe no significant differences for any pair despite profound differences in 
demographic history (Table 2 and Table S3). 

To interpret these null findings, we carried out simulations using fitted models of the 
demographic histories of West African and European populations 5 ' 6 ' 15 . (For most of our 
simulations, we assumed that mutations act additively although in Figure SI we show that 
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qualitatively different patterns are expected for mutations that act recessively, a phenomenon that 
we explore in a separate study 16 .) The simulations predict that RwestAfHca/Eumpe will be below 0.95 
when s is between -0.004 and -0.0004 (Figure 1). However, if many mutations have selection 
coefficients outside this range, the signal could be diluted to the point of not being detectable. 
Indeed, when we compute the expected value of RwestAjHca/Europe integrating over a previously 
fitted distribution of selection coefficients 17 , we find that R W estAfrica/Europe(non-synonymous) is 
expected to be 0.987, too close to 1 to be easily detectable given the standard errors of our 
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empirical measurements (Table 1) . We next attempted to boost our power by stratifying non- 
synonymous sites based on their predicted functional effects. The PolyPhen-2 19 and SIFT 20 
algorithms both predict function in a way that is dependent on the ancestral/derived status of 
allelic variants compared with the human reference genome, which has a particular ancestry at 
every segment that can bias measurements. We thus implemented a version of PolyPhen-2 that is 
independent of the allelic status of the human reference (Methods). We continue to observe no 
significant differences 18 (Table 1, Table 2, Table S2, Table S3), although this null result may 
also reflect low power as when we infer the distribution of newly arising mutations for different 
PolyPhen-2 classes (Note SI), RwestAfnca/Eumpe is predicted to be 0.984-0.993 (Table S4). 

To increase statistical power to detect any real differences in the net effectiveness of selection, 
we leveraged the fact that the population split between African and non-African populations 
occurred only in the last roughly one hundred thousand years. Mutations that arose prior to the 
population divergence are expected to dilute any true signal. We therefore computed a time- 
stratified i?4/-n C a«/jjoM-4/i-/co«-statistic, taking advantage of data from 4 experimentally phased 
African and 6 experimentally phased non-African genomes, all processed similarly 13 . We created 
pseudo-diploid genomes by merging 1 African and 1 non- African phased genome, for a total of 
96 = (4x2)x(6 x 2) comparisons, and then used the Pairwise Sequential Markovian Coalescent 
method (PSMC) to infer the local time since the most recent common ancestor (TMRCA) at each 
location, masking out the exome to avoid circularly using the same sites for inferring TMRCA 
and computing RAfrican/non-African- Table S5 shows that when we restrict to the subset of the genome 
with the lowest inferred TMRCA in this analysis, we still detect no differences. 
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As a third way to increase statistical power, we analyzed a class of sites that is much larger than 
the class of coding substitutions so that we can make measurements with much smaller standard 
errors; this is the class of sites affected by biased gene conversion (BGC). BGC is a selection- 
like process in which DNA repair acting on heterozygous sites in gene conversion tracts favors 

2 1 

GC over AT alleles . Since BGC only acts on heterozygous sites, it occurs at a rate proportional 
to heterozygosity, or 2p(\-p) for a mutation of frequency p, exactly mimicking additive 
selection 21 . We find that R'^sf Africans, 'Europeans = 0-995 ± 0.002 and 

R'wett 'Africans! 'Europeans = i- 000 ± °- 002 by using an 

R ,ciassi^aass2 = R ciassi^ciass2 /( R a~t +R c~c^ statistic that normalizes by the rate of A^T 

and C^G substitutions not expected to be affected by BGC. A further advantage of this 
normalization is that it also corrects for possible differences in mutation rates across lineages. 
For diverse population comparisons, we detect no differences significant at |Z|>3 standard errors 
from zero with the exception of San Bushmen who have about 1% more GC— >AT mutations 
than other humans (significant at up to 8 standard errors) (Table S6). This is the first direct 
detection of less effective removal of mutations in some present-day humans than in others, and 
is consistent with the San being amongst the most deeply diverged human populations, which 
may have provided more opportunity for slight differences in the effectiveness of removal of 
mutations across populations to have a cumulatively measurable effect 22 . 



To demonstrate empirically that differences in the accumulation of non-synonymous sites can be 
empirically detected given a sufficiently ancient population divergence time and sufficiently 
different subsequent demographic histories, we also analyzed two deeply sequenced genomes 
from archaic humans: Denisovan and Neanderthal. The ancestors of both are inferred to have 
maintained relatively small effective population sizes for on the order of a half million years 
since their main separation from present-day humans, consistent their levels of genetic diversity 
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being 3-6 times smaller . A challenge in comparing the accumulation of mutations in present- 
day human samples to ancient samples is that fewer mutations are expected to have occurred on 
the lineage of ancient samples because they are from individuals who lived closer in time to the 
common ancestor. To correct for this, we divide the accumulation of non-synonymous mutations 
on each lineage by the synonymous sites: R ' x/Y(non-synonymous class) = Rx/y (non-synonymous 
class)/Rx/y(synonymous). After removing C— >T and G— >A mutations in the archaic genomes that 
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have evidence of degradation leading to ancient DNA errors, we find that present-day humans 
have accumulated deleterious mutations at a significantly lower rate than Denisovans since 
separation: R 1 Modem/DenUova(non-synonymous) = 0.872 ± 0.034 (P=0.0002) (Table S7) 12 . In 
contrast, R ' Modem/Neandenhai(non-synonymous) = 1.037 ± 0.037 is consistent with 1, suggesting that 
deleterious mutations have not been removed as effectively on the Neanderthal as on the 
Denisovan lineage (Table S7). The higher resolution BGC analysis further detects that the 
effectiveness of removal of mutations on the Neanderthal lineage was intermediate between that 

Of Denisovans and modern humans: R'^oruba/Denisova = °- 92 5 ± 0.004, R'yoruba/Neandertal = 

0.961 + 0.003, and R'oenisova/Neandertai = 1-041 + 0.005. The different rates of accumulation 
in Neanderthals and Denisovans despite similar inferred demographic histories (Figure 1) 
suggests that fitted models of demographic history, e.g. for West Africans and Europeans, may 
not always provide accurate predictions of the relative effectiveness of removal of mutations. 



In light of these results, is there any reason to believe that weakly deleterious mutations have 
been removed less effectively in Europeans than in West Africans? The strongest previous 
evidence for such an effect was based on an alternative statistic: the proportion of polymorphic 
sites in the exome that are non-synonymous 5 . We therefore carried out simulations of 
demographic history that allowed us to study the change in this statistic over time; our 
simulations agree with previous simulations of the same statistic 5 , but allow for additional 
insights into the evolutionary forces responsible for the dynamics (Figure 2). In the first set of 
simulations, we adjusted the selection coefficient s in Europeans in each generation so that the 
quantity governing the effectiveness of selection Ns was by construction always the same in 
Europeans and West Africans. The simulations show that the proportion of non-synonymous 
sites in Europeans in fact rises more, not less, when we adjust our simulations in this way to 
eliminate any differences in the effectiveness of selection (blue curves in Figure 2B). Thus, the 
observed rise in the proportion of non-synonymous sites in Europeans is not due to differences in 
the effectiveness of selection after the population split, but occurs in spite of such differences. In 
the second set of simulations, we partitioned the change in the proportion of non-synonymous 
sites over time into selection and neutral effects. The simulations show that the dynamics of the 
previously studied statistic are driven by neutral forces (correlation coefficient of p = 0.96 to 
0.99). In contrast, changes in the effectiveness with which selection removes deleterious alleles 
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in Europeans compared with West Africans has an effect that is opposite to the observed change 
(p = -0.45 to -0.27) (Figure 2C and Figure 2D). Intuitively, prior to the West African / European 
split, allele frequencies of non-synonymous polymorphisms would, on average, have been much 
lower due to the depletion of non-synonymous sites by selection, and the per-site density of non- 
synonymous segregating sites would also have been lower. The population entering the 
bottleneck primarily loses rare alleles, so the non-synonymous site allele frequency distribution 
would be expected to adjust faster than that for synonymous sites. Once the population re- 
expands, the allele frequency distribution for non-synonymous sites also adjusts faster, in this 
case because the same flux of new mutations into both classes causes a faster rate of 
replenishment of non-synonymous sites than synonymous sites due to an initially lower density. 
These findings illustrate the complexity of the interactions between selection and demographic 
history in their effects on genetic variation, and highlight an opportunity first identified by ref. 5 , 
which is that joint analyses of demographic history and natural selection can provide more 
insight into the nature of both evolutionary forces that either type of analysis alone. 

It is tempting to interpret the indistinguishable accumulations of deleterious mutations across 
present-day humans as implying that the overall genetic burden of disease should be similar for 
diverse humans. To the extent that mutations act additively this is correct, and it implies that the 
complex demographic events of the past are not expected to lead to substantial population 
differences in prevalence rates of complex disease that have an additive genetic architecture 18 . 
However, recessively or epistatically acting mutations work in combination to contribute to 
disease risk, and since demographic history affects allele frequencies, it affects the rate of co- 
occurrence of alleles. For example, Table 1 and Table S8 show that the absolute count of alleles 
occurring in homozygous form is empirically higher in non-Africans than in Africans for all 
functional site classes, confirming previous findings 5 . Thus, the relative risk for diseases that are 
contributed to by recessively acting mutations could be expected to be influenced by 
demography, as indeed is known to be the case for populations that experienced recent founder 
events like Ashkenazi Jews and Finns. An important direction for future work is to determine the 
extent to which mutations contributing to phenotypes act non-additively, as this will determine 
the extent to which demographic history is important in affecting human disease risk. 
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Methods 
Data 

The datasets we analyzed were published previously and are summarized here. 

"Ce/era": PCR amplification and Sanger sequencing was performed on 15 African 
American (AA) and 20 European American (EA) samples over the coding sequences of 10,150 
genes. We downloaded ancestral and derived allele counts for 39,440 autosomal SNPs from the 
supplementary materials of the original study, restricting to sites with genotypes available from 
both AA and EA 5 . 

" 1000 Genomes (1KG)": A total of 1,089 samples from 14 populations were analyzed in 
Phase 1 of the 1000 Genomes project. Illumina-based exome sequencing 11 was performed to 
~100 x average coverage after solution hybrid capture of the exome . 

"ESP": A total of 1,088 African Americans and 1,351 European Americans were 
sequenced as part of the National Heart, Lung and Blood Institute Exome Sequencing Project. 
Illumina-based exome sequencing was performed to ~100 x average coverage after solution 
hybrid capture of the exome 6 . 

"24 Genomes''': This dataset includes 2 samples each from 6 non-African and 5 sub- 
Saharan African genomes, an archaic human from Denisova Cave in Siberia sequenced to 30 x 
coverage, and an archaic Neanderthal from Denisova Cave in Siberia sequenced to 52* 
coverage. All sequencing data is based on Illumina technology. We used the version of this 
dataset reported in ref. 13 . We only analyzed sites with genotype quality scores (GQ) of >45 24 . 

Mutation annotation 

25 

We annotated coding mutations using ANNOVAR , which classifies sites as "non- 
synonymous", "synonymous", "stop-gain" or "stop-loss". We sub-classified variants using a 
simplified version of PolyPhen-2 that is independent of the ancestral/derived status of the human 
genome reference sequence ("human-free Polyphen-2"). To guarantee independence of 
PolyPhen-2 predictions from the human genome reference sequence, we created a simplified 
version that relies solely on the multi-species conservation score used in this method 26 . This 
score reflects the likelihood of observing a given amino acid at a site conditional on the observed 
pattern of amino acid changes in the phylogeny, and is the most informative feature of PolyPhen- 
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2. The predictions in our simplified PolyPhen-2 method are based on the absolute values of the 
difference of the scores for the two alleles. By construction, this is symmetric with respect to 
reference/non-reference (and also ancestral/derived, major/minor) allele status. This procedure is 
similar to the original version of PolyPhen, but relies on the PolyPhen-2 homology search and 
alignment pipeline. 



We are interested in the expected number of mutations in a randomly sampled haploid exome 
from one population that are not seen in a randomly sampled comparison exome from another 
population. To compute this in a situation where we have many exomes available from each 
population, we do not wish to literally randomly choose an exome from each population as this 
would throw away data decreasing the precision of our estimates. Instead, we obtain the expected 
value if we performed an infinite number of random samplings. To compute this, at each variable 
site i in the genome we define d l A as the count of the mutant allele at that site in a sample of 
n l A exomes from population A and d l B as the count of the mutant allele in a sample of n l B exomes 
from population B. The expectations are obtained by summing over all sites in the genome. 



For some analyses, we also wished to compute the relative probability that a population is 
homozygous for a derived allele whereas the other is not. Thus, we defined two additional 
statistics, now imposing a correction for limited sample size (since we need to sample two alleles 
from each population, we need to sample without replacement): 



Statistics 
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We then defined the ratio statistics 




9 



Downloaded from http://biorxiv.org/on September 18, 2014 



Weighted block jackknife to estimate standard errors 

We obtained standard errors using a weighted block jackknife 10 . We divided the SNP datasets 
into 100 contiguous blocks and then recomputed the statistic on all the data except for the data 
from that block. The variation can be converted to a standard error using jackknife theory. We 
assess significance based on the number of standard errors from the null expectation of R=l, and 
compute a P-value using a Z-score assuming a normal distribution. 

Time-stratified computation of relative accumulation of deleterious mutation 

We began with data from 10 experimentally phased genomes, all processed in a nearly identical 
way 13 . These genomes consisted of one each from the populations in Table 2 except for the 
Dinka. We then combined haploid genomes from one of 4 African and one of 6 non-African 
individual to make 96 = (2x4)x(2><6) pseudo-diploid individuals. We masked the data from the 
exome, and ran the Pairwise Sequential Markovian Analysis (PSMC) 2 on the data to estimate the 
time since the most recent common ancestor of the two phased genomes at each location in the 
genome. We stratified the data into three subsets of inferred time depth, and then computed the 
RAfrican/non-African-statistic within each time-stratified subset (using exomic sites that had been 
masked from the PSMC analysis so we could independently use them for analysis). 

Analysis of sites susceptible to biased gene conversion (BGC) 

We computed the accumulation of mutations susceptible to BGC for three different classes: 
GC— »AT mimicking negative additive selection, AT— ^GC mimicking positive additive selection, 
and A/T or G/C polymorphisms which we treat as neutral (and use as the denominator of R 'x/y)- 
For BGC analyses we use the entire genome, after excluding sites in the exome. For analyses 
involving ancient samples, we exclude C— >T and G— >A sites from the analysis of GC— >AT 
substitutions (we restrict to C— >A and G— >T substitutions), to avoid the degradation errors 
typical of ancient DNA that we are concerned may affect even the consensus genome sequences. 

R Ayj-statistic: Correcting for branch shortening and differences in mutation rates 

For analyses involving the archaic Denisovan and Neanderthal samples, which are many tens of 
thousands of years old and thus have experienced less evolution from the common ancestor than 
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present-day humans to which they are compared, we do not expect that L arc haic not modem = L mo d er n 
non archaic even for neutral sites. For all analyses involving ancient samples, we instead compute 
normalized statistics L A not b' and L B not a', where we divide both L A not b and L B no t a by the 
accumulation of mutations at sites that are expected to act neutrally (synonymous sites for coding 
sequences and A/T + C/G for BGC). Thus, L'auIIb = Lfnot b I ' L™™^ 2 ^™ l . We then define: 

n' class _ aclass iiclass \ / r i normalization / 1 normalizations _ n class /n normalization 
K X/Y — \. L XnotY/ L Ynotx)/\. L 'XnotY I L Y not X ) ~ n X/Y l n X/Y 

This R x/y-statistic not only corrects for branch shortening in the ancient samples, but also has the 
benefit of correcting for any differences in mutation rate that might have arisen on one lineage or 
another since population separation. 



Simulations 

We wrote a forward simulation in C that implements an infinite sites model. Each mutation is 
assumed to occur at an unlinked site. To the extent that linkage affects the expected values of the 
statistics we compute, our simulations are not capturing these effects. 

There is an initial burn-in period of 250,000 generations to generate an equilibrium allele 
frequency spectrum. The simulator samples the allele counts in the current generation based on 
the frequencies in the previous generation, the selection coefficient s, the dominance coefficient 
h (usually set to additive or h=0.5), and the current population size. 

For modeling West African and European history in the simulations reported in the main 
text, we use a demographic model previously fitted to genetic data 6 . (Figure SI reports results for 
other four different demographic histories, shown in Table S9). For comparisons of West African 
and archaic population history we also use a previously fitted demographic model . We use a 

Q 

mutation rate of 2x10" /base pair/generation. 

For analyses of the proportion of sites that are expected to be non-synonymous in a 
sample size of 40 chromosomes, we use a hypergeometric distribution to obtain the expected 
value of this statistic. If the population size in a generation is N and K t is the number of copies of 
the derived allele at site i, then we can compute the probability that a sample of 40 chromosomes 
is polymorphic at a site as 1 minus the hypergeometric probability of 0 or 40 derived alleles: 
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Probability that site i is segregating = 1 0 , M 40 0 , N ° — 

We average this probability over all simulated positions to obtain the density of segregating sites. 
Integrating over distributions of selection coefficients 

For some statistics, we wished to obtain an expected value integrating over distributions of 
selection coefficients. To achieve this, we carried out simulation series for different selection 
coefficients; for example, for Figure 2, each of 19 values: s = { -lxlO 6 , -2x1 0 6 , -5><10 6 , -l x 10 5 , - 
2xl0 5 , -5xl0 5 , -lxlO 4 , -2xl0 4 , -5xl0 4 , -lxlO 3 , -2xl0 3 , -5xl0 3 , -0.01, -0.02, -0.05, -0.1, -0.2, - 
0.5, -1}. To compute expected values for L Xno tY, Lrnotx, and the density of segregating sites per 
base pair in a fixed sample size of 40 chromosomes, we use a weighted average of the values of 
the simulated single selection coefficient statistics. For some analyses, we use the distribution of 
human selection coefficients for non-synonymous sites from ref. 17 , where the probability of a 
given value of -s is drawn from a gamma distribution fitted to European genetic data with 
a=0.206 and (3=15400. For analyses of the expected value of Rwesujnca/Europe stratified by 
PolyPhen-2 functional class, we use the values inferred in Note SI. 



Simulations forcing the effectiveness of selection in Europeans and West Africans to be the 
same 

To evaluate whether differences in the effectiveness of selection in the Europeans and West 
African populations since their split could be explaining the observed rise in the proportion of 
non-synonymous sites in Europeans above baseline, we modified the simulator so that in every 
generation i, the selection coefficient in Europeans s e j is determined dynamically. Define N e j and 
N a j as the diploid population sizes in Europeans and Africans, respectively, and define the 
selection coefficient in Africans (held constant) as s a j- We then set the selection coefficient in 
Europeans in each generation to be s e j = s a ,i(NiJN it e). This procedure has the consequence that 
N e ,iS e ,i = N e js a ,i(N a ,i/N e j) = N a ,iS a ,i . Since the quantity Ns governs the effectiveness of selection, 
selection is guaranteed to be equally effective in both populations at all times. 

Partitioning the evolutionary dynamics into the effects of selection and neutral effects 
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We modified the simulation to sample two counts of derived alleles in each generation for a 
given selection coefficient s and nucleotide position i. The first count is A s j„ which reflects the 
forces of selection, mutation, and stochastic sampling. The second is N s ,u which reflects the 
effects of mutation and stochastic sampling but not selection. The counts in each generation are 
only updated based on A s i _i in the previous generation, so the long-term evolutionary trajectory 
properly incorporates the effects of natural selection as they have accumulated over generations. 

We average the values of N s j and A s j over simulation replicates, and compute a weighted 
average of the results based on the distribution of selection coefficients from ref. 17 . We define 
the proportion of sites that are expected to be non-synonymous in a given generation as PropAll, 
= A Sti /(A s j+ Ao,t), and the proportion that would be non-synonymous if selection had been 
switched off in that generation as PropNeu, = N s j/(N Sti + Ao,i). 

With this notation, the expected change in the proportion of nonsynonymous sites due to 
all evolutionary forces in generation i is hPropAlli = PropAll r PropAlU. The proportion due to 
neutral forces only is SPropNeUj = PropNeu- PropAU^, and the proportion due to selective forces 
only is SPropSeh = PropAll- PropNeu/. 

To compare the effect of an evolutionary force in a given generation to what it was in the 
ancestral population prior to the split (>2,040 generations ago in the simulations of Figure 2), we defined 
the baseline-corrected statistics APropSelj = SPropSelj-SPropSel baseline, APropNeui = SPropNeut 
SPropNeubaseiine, and APropAlU = 8PropAlli-SPropAll hase i ine . These statistics are positive if the 
effectiveness of removal of mutations due to an evolutionary force is less than in the ancestral 
population, and negative if the effectiveness is greater than in the ancestral population. These are 
the statistics plotted in Figure 2C and integrated to obtain cumulative effects in Figure 2D. 
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Figure Legends 

Figure 1: The effect of demographic history on the accumulation of deleterious mutations.. 

To study the expected value of RwesujHca/Euwpe stratified by selection coefficient, we simulated a 
previously published model of the joint history of West Africans and Europeans 6 , for a range of 
selection coefficients, assuming an additive model of nature selection (Figure SI shows similar 
results for other demographic models). The simulations show that RwestAfHca/Europe dips below a 
potentially detectable ratio of 0.95 fors G (—0.0004,-0.004). We also simulated a published 
model of the history of archaic Denisovans, archaic Neanderthals, and West Africans 13 . The 
simulations predicts similar curves for R westAfrica/Denisova and R WestAfrica/Neanderthai reflecting their 
similar inferred demographic histories (we use a normalized R ' statistic to correct for the effects 
of branch shortening in these ancient lineages). The simulations show that R'westAfrica/Denisova is 
expected to below a detectable ratio of 0.95 for s G (-0.00002, -0.03) and that 
R 'westAfrica/Neanderthai is expected to be below 0.95 for s G (—0.00002, —0.09). 

Figure 2: The rise in the proportion of non-synonymous sites in Europeans compared with 
West Africans is not due to a reduced effectiveness of selection in Europeans since the split. 

(A) The West African and European diploid population sizes for the two simulated models, both 
of which specify a population split 2,040 generations ago. The plots of the temporal dynamics of 
the statistics in subsequent panels are restricted to the European population, as the West African 
population size does not fluctuate enough to result in appreciable changes from the baseline. (B) 
We show the values of key statistics as a fraction of the baseline. The present-day proportion of 
non-synonymous sites in Europeans is higher than in the ancestral populations (black curves). 
This cannot be attributed to less effective removal of deleterious mutations in Europeans than in 
Africans since the population split, as we can see from the fact that when we carry out 
simulations in which the selection coefficient in Europeans per generation is set so that the 
effectiveness of selection Ns in the two populations is the same, the rise actually becomes greater 
(blue curves). (C) Partitioning of the change in the proportion of non-synonymous sites per 
generation into neutral and selective forces shows that for the West African / European 
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comparison, the temporal dynamics are driven by neutral forces (strong positive correlation in 
the dynamics) and not by the selective forces (negative correlation). (D) Plots of the cumulative 
effect of each evolutionary force compared to baseline show that differences in the effectiveness 
of selection between Europeans and West Africans since they separated in fact have a net 
negative, not a net positive effect on the proportion of non-synonymous sites in Europeans. 
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Table 1: Accumulation of different classes of mutation in exomes of West African compared to exomes of European ancestry 
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Notes: ±1 standard errors are obtained from a Block Jackknife with 100 equally sized blocks. For the whole genomes, Yoruba+Mandenka represent West 
Africans, and French+Sardinian represent Europeans. For the 1000 Genomes Data (1KG), YRI represent West Africans and CEU Europeans. The results for the 
Celera and ESP analyses represent people of West African ancestry using African Americans. 



Table 2: Rx/Y(probably damaging) for all pairs of deep genome populations (bottom left) and 1000 Genomes populations (top right) 
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Notes: ±1 standard errors (parentheses) are based on a Block Jackknife with 100 equally sized blocks. We highlight numbers >4 standard errors from expectation. 
Ratios for Neanderthal and Denisova are normalized by the number of synonymous sites on each lineage, to adjust for the expectation of fewer mutations in the 
ancient samples than in the present-day human samples due to less time elapsed since divergence (all other comparisons are un-normalized). Ratios are based on 
the accumulation observed in the population in the row divided by that in the population in the column. Numbers in parentheses are sample sizes. 
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